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Protein Expression and Structure Solution using specific 

Fusion Vectors 

The present invention relates to a recombinant protein 
being composed of three components (1) , (2) and (3) , a 
DNA sequence encoding such a recombinant protein,' a 
vector expressing such a recombinant protein, a host cell 
transformed with a vector, a method for producing a 
recombinant protein, and methods for purification, 
crystallization and structure elucidation of the 
recombinant protein. 



15 



20 



The first step, and perhaps the single most important 
step, in the crystallization of a macromolecule, e.g. a 
protein, is its purification. Any impurities of the 
protein solution to be used for crystallization may 
impair crystal quality or, even worse, preclude the 
formation of crystals at all. 



25 



Procedures for 



accomplishing the highest degree of 
purification possible have been under development for 
more than 200 years, and recent times have seen an 
explosion in the invention of new methods and refinement 
of old. There is a variety of methods that exemplify how 
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problems of protein purification for protein analysis or 
protein crystallization have been approached. 

One such method is fractionation with salts and other 
5 precipitants. Hereby, proteins are precipitated from a 
complex mixture (e.g. a physiological fluid) by addition 
of various concentrations of different salts. Because 
individual proteins precipitate at different salt 
concentrations, this "salting out" phenomenon provided a 
10 method for selectively precipitating, and thereby 
" purifying, unique proteins from a mixture (Morris and 
Morris, 1964, Separation methods in biochemistry, Pitman, 
London, GB) . A minor disadvantage of salt f ractionatxon 
is that protein preparations, be they supernatant s or 
15 precipitates, are left with high residuals of salt. This 
' m ay seriously interfere with the evaluation of activity 
and purity and with subsequent purification procedures 
The most common of these methods is dialysis in celluloid 
or collodian tubes. 

20 Apart from varying the concentration of a salt, proteins 
m ay be selectively precipitated and fractionated by the 
addition of a variety of organic solvents (Conn et al . , 
19 47 Crystallization of serum albumin from ethanol/water 
25 mixtures, a. Am. Chem. Sec. 69, 1753). This ^ ^lly 
carried out at subzero temperatures ranging to -30 C to 
enhance the precipitation effect and to minimize the 
denaturation of the protein. In addition to salt and 
organic solvents, other materials have been used to 
30 precipitate and fractionate a mixture of proteins. Some 
of these materials are, for example, protamine (a mixture 
of small basic proteins) and polyeneimine (a basic 
organic polymer), which apparently cross-links some 
protein via electrostatic bridges. Moreover, metal ions 
35 or organic polymers, such as polyethylene glycol (PEG) 
were extensively used for purification purposes. PEG 
seems to act as a hybrid between an alcohol and a salt 
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and their precise' properties may vary as a function of 
mean polymer length. 

Still another method of protein purification is the 
5 selection of proteins with heat or pH. pH is effective 
because most proteins exhibit pH-dependent solubility 
minima and precipitate Or even crystallize from solution 
at particular values, whereas the property of protein 
heat stability may sometimes provide a valuable 
10 purification step. 

Other protein purification methods based on physical 
techniques are also well-known to a person skilled in the 
art. E.g. centrifugation has to be mentioned, whereby a 
is solution containing multiple components varying in 
weight, size, and density is deployed in a tube and 
rotated at high angular velocity. An almost preparative 
centrifugation is conducted on some gradient with various 
density from the top to the bottom of the centrifuge 
tube. Two common techniques utilized in connection with 
density gradient separation are sedimentation velocity 
and sedimentation equilibrium or isopycnic 

centrifugation. Furthermore, electrophoretic separation 
methods (Svensson, 1947, Preparative electrophoresis and 
25 ionophoresis, Adv. Protein Chem. 4:251) are routinely 
used and are based on the application of an electrical 
field across and insoluble, porous support medium 
permeated by a buffer solution. Dependent on the net 
charge of the proteins to be separated they will 
experience and electromotive force and migrate toward one 
electrode (cathode or anode) . For the separation of 
proteins, polyacrylamide gels as support medium have 
shown to have almost ideal properties. 



Finally, chromatographic methods are especially well 
suited to separate proteins and to purify the target 
protein for later crystallization steps. Classic ion 
exchange chromatography is simply conducted by packing a 
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vertical hollow glass column with an insoluble resin or 
colloidal matrix that exhibits an array of positively 
charged (anion exchange chromatography) or negatively 
charged chemical groups (cation exchange chromatography) . 
Ion exchange chromatography is based on the fact that a 
positively charged protein will be retarded or bound to 
electrostatic interactions with a. matrix carrying 
negatively charged groups or vice-versa for negatively 
charged proteins- Dependent on their respective net 
charge the proteins to be separated will appear in the 
eluent sequentially with time (or volume). Molecules 
tightly bound to the matrix may be eluted from the column 
by competition with other charged ions. In contrast to 
ion exchange chromatography, molecular sieve 

chromatography (also called gel permeation 

chromatography) separates molecules on the basis of 
molecular weight and shape. Hereby, macromolecules , like 
proteins, - are induced to flow by gravity or pressure 
through a column containing a matrix of microscopic beads 
perforated with a vast network of channels. Thereby, the 
high molecular sieving effect will influence the speed in 
passing from the top to the bottom of the column leading 
to the inverse effect that larger molecules will appear 
first in the column eluent. Finally, absorption 
chromatography, HPLC (high performance liquid 

chromatography), and affinity chromatography are also 
well established as biochemical purification methods. 
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All of the above-mentioned methods exhibit certain 
advantages and disadvantages. Consequently, the person 
skilled in the art will choose the purification method 
which appears to be most appropriate for a given system. 

Since a number of years purification methods take 
advantage of recombinant proteins (Kane and Hartley, 
1988, Development of expression systems for production of 
high levels of protein, Trends Biotechnol . 6: 95). 
Recombinant proteins are produced by recombinant DMA 
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techniques in bacteria, yeast or other organisms such as 
virus infected mammalian or insect cells. The advantage 
of recombinant proteins is based on genetically designed 
element., that aid the biochemist in applying one of "the 

TtlZTT ed PhySiCal ^ bi ~~* Purification 
methods. For example, a series of histidine residues, a 
so-called ..his,tag», may be appended to the carboxyl 
termxnus- , of a recombinant protein. Such a histidin- 
appendxx makes it easier to isolate the expressed protein 
10 on a copper or nickel containing chromatographic resin, 
columns"^ co ™^cially in prepacked 

A second procedure in wide use for the purification of 
is recombinant proteins is the fusion of an expressed 

7 G lT\: ith Sn2yme glUtathi ° ne •ul'ur transferase 

(GST) . Th 1S enzyme has a very high affinity for the small 
peptide glutathione. Following expression of the protein 
an extract of the cells is passed over a small 
20 chromatography column containing a matrix conjugated with 
glutathione. The chimeric protein is then reversibly 
bound on the column through the GST, contaminants are 
washed from the column, and finally the recombinant 
protein is eluted with free glutathione and collected 
2S The GST may then be cleaved from the chimer by a specific 
protease to produce the free recombinant protein. Again, 
the chromatographic matrix may be obtained commercially 
m prepacked columns. 

30 Furthermore, e.g. pMAL™ (by New England Biolabs Inc ) is 
used as protein fusion and purification system as prior 
art. This system comprises the insertion of the cloned 
gene into a pMAL vector downstream from the malE gene 
whach encodes maltose binding protein (MBP) . The fusion 

35 protein (target protein and MBP) is expressed in large 
quantities and purified by affinity chromatography for 
MBP using amylose resin. Finally, MBP is cleaved from the 
target protein by a specific protease. 
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These techniques utilizing recombinant p.otexns allow to 
obtain extraordinarily pure fractions of the target 
protein. However. advantageous conditions for 
ication, crystallization and structural analys s 
have to be tested using, the MBP/target protein or 
OSr ta get protein fusion systems .. for each s.ng 
recombinant and/or target protein, Particu arly <i 
th ere are still complex chromatographic UP vitro) 
p U rif ication steps required "££ZZZ nil 
of the target protein and ^ ^ are 
like crystallization or structure 

complicated by the unknown properties of the target 
■ nv. » o the crystallization conditions of a 
protein, like e.g. the y n 

specific target protein purified as 
protein . 

The object of the present —ion is to ~e £ 
above-mentioned disadvantages of the p 
narticularly to provide a system which allows to reduce 
particularly v offort for purification and 

considerably the time effort for P ^ 
subsequent crystallization as well a 
determination of any protein to be analysed. 

v.- ^ w a recombinant protein 
This object is achieved by a reco 

, i.im i a DNA sequence encoding such a 
arrordina to claim i, a ui»n 

according vector according to claim 

protein according to claim 8, a vector * 

10. a transformed host cell according to daim 12. a 

Method for producing a recombinant protein 

claim 14, and methods for purification 

and structure elucidation according to claims 15, 21 

23 . 

Therefore, it is an object of the present invention in 
one aspec , to provide recombinant proteins which contain 
as component (1, an amino acid sequence of a member of 
the myosin or kinesin protein superf amilies or an ^ amino 
acid sequence of an analog, fragment or derivative of 
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member of the myosin or kinesin protein superf amilies, as 
component (2) any - amino acid sequence of at least 20 
amino acids in length (target protein sequence) , and as 
component (3) a linker region of at least 2 amino acids 
5 between components (1) and (2) (claim 1). However, the 
subject-matter of the present invention works with any 
protein or fragment, derivative or analog thereof as 
component (1) , which binds to a any molecule or structure 
of the cytoskeleton or a cell membrane in a ligand 
io dependent manner. Particularly preferred as component (1) 
are molecules, which exhibit a flexible region, 
particularly at the C- terminal region of component (1) , 
in order to sample for multiple conformations. 

As mentioned above, the present invention further 
is concerns analogs, fragments and derivatives of the 
recombinant protein of the invention. The preparation of 
such analogs, fragments and derivatives is by a standard 
procedure (Sambrook et al . (1989), Molecular Cloning: A 
Laboratory Manual, Cold Spring Harbor, New York) in which 
20 in the DNA sequences encoding the inventive recombinant 
protein, one or more codons may be deleted, added or 
substituted by another, to yield analogs having at least 
one amino acid residue change with respect to the native 
recombinant protein, particularly with respect to the 
25 native amino acid sequence of component (1) or of 
component (2) of the recombinant protein of the 
invention. 

Analogs that substantially correspond to the native 
sequence of one or more components of the inventive 

30 recombinant protein are those polypeptides, in which one 
or more amino acids of the native protein's amino acid 
sequence has/have been replaced by another amino acid, 
deleted and/or inserted. In a preferred embodiment of the 
present invention, the resulting components ((1) or (2)) 

35 being incorporated into the recombinant protein of the 
invention exhibit substantially the same or even higher 
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biological activity as the corresponding native protein 
to which it corresponds or exhibit at least structurally- 
similar properties as the native protein to which the 
component corresponds. In order to substantially 
correspond to the native sequence of component (1) or (2) 
of the recombinant protein of the invention, the changes 
in the sequence of the components are generally and 
preferably relatively minor, such as isof orms . Although 
the number of changes may be more than 10, preferably 
there are no more than 10 changes, more preferable no 
more than 5 and most preferably no more than 3 changes in 
component (1) or (2) as compared to the respective native 
sequence. While any technique may be used to find 
potentially biologically active sequences of a component 
of the inventive recombinant protein, which substantially 
correspond to the respective native proteins, one such 
technique is the use of conventional mutagenesis 
techniques on the DNA encoding the protein, resulting in 
a few modifications. The sequences used for component (1) 
or (2) in the recombinant protein of the invention which 
are expressed by such clones, may then be screened for 
their ability e.g. to bind to their native binding 
partners, mediate activity etc. , in other words fulfil 
their biological role. 
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Conservative "changes" are those changes which would not 
be expected to change the activity of the protein and are 
usually the first to be screened as these would not be 
expected to substantially change size, charge or 
structure of the polypeptide sequence used as component 
in the recombinant protein of the invention and thus 
would not be expected to change the biological properties 
of the corresponding native sequence. For example, 
conservative substitutions are assumed, if (a) small 
aliphatic, non-polar or slightly polar residues are 
substituted by other residues belonging to the same 
group, (b) polar negatively charged residues and their 
amides are exchanged for other residues belonging to the 
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same group, (o) polar positively charged residues are 
exchanged for polar positively residues, (d) large 
aliphatic non-polar residues are exchanged for large 
aliphatic non-polar, residues, and (e) finally, aromatic 
residues are substituted by other aromatic residues.- In 
most cases, analogs being .used as component (i) or (2) of 
the recombinant, protein of the invention are defined 
(according to the present invention) as sequences with 
substitutions which do not produce radical changes in the 
characteristics of the corresponding native protein or 
. polypeptide molecule. Characteristics may be the specific 
secondary structure of a sequence, e.g. a-helix or &- 
sheet, as well as its specific biological activity. 

It is noted that apart from sequences being used as 
component (i) or (2) for a recombinant protein according 
to the present invention, which are based on conservative 
substitutions as discussed above, analogs with more 
random changes, which lead to a radical or more radical 
change in biological activity or structure of the analog 
as compared to the native sequence are also within the 
scope of the present invention. 

At the genetic level, these analogs are generally 
prepared by site-directed mutagenesis of nucleotides in 
the DNA encoding the inventive recombinant protein or the 
component of the recombinant protein, respectively, 
thereby producing DNA encoding the analog and thereafter 
synthesizing the DMA and expressing the polypeptide in 
recombinant cell culture. Insofar, it is referred to 
Ausubel et al., Current Protocols in Molecular Biology 
Green Publications and Wiiey Intersigns, New York, New 
York, 1987 - 1995; sambrook et al . , Molecular Cloning- 
Laboratory Manual, Cold Spring Harbor Laboratory, New 
York, 1989, the disclosure of which is incorporated 
herein by reference. Furthermore, site-specific 
mutagenesis allows the production of analogs through the 
use of specific oligonucleotide sequences that encode the 
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enrp of the desired mutation. The technique of 
-DNA sequence of the desi , ified by publications 

site-directed mutagenesis is exemplified by p 
such as Adelman et al., DNA 2 : 183 

disclosure of which is incorporated herein by reference^ 
Epical vectors useful in site-directed mutagenesis 
Typicax , for example as 

include vectors such as ;.M13 -phage, 

include Cleveland Symposium on 

disclosed by Messing et, al (3 . 
Macromolecules and recombinant DNA, editor 
E lsevier, Amsterdam (1981)), the disclosure of which 
0 incorporated herein by reference. 

* s far as derivatives of the native sequence of 
Is of the recombinant protein of the present 
3S « — ed, derivatives may be prepared^ 

£ ^vi^ Qide qroups of one or more 
standard ZLluJ, protein of the 

L5 amino acid residue ; h ; r ^ nts or by conjuga tion of 

the native be 4 Ana loas or fragments, 

~ive ^ t ^- an ^ bo a d ;: 1 rz ym e. receptor, 

to another molecule, e.g. an antioo y, 

20 et c. accordingly, "derivatives., as ^ 

derivatives which may he prepared from the 
gr oups which occur as side chains on the Ie3 ^\ „ t 
the N- or C- terminal groups by means Known in the art^ 
^ have chemical moieties such as 
Derivatives may have c example, 

25 carbohydrates or ^"'J^Z of the carbo.yl 
derivatives ^ include alip-ic este ^ 

groups, amr des of the^ amines , N . a cyl 

— ° r free a^ no groups of the amino acid 

derivatives or free ami 9 derivatives 

30 residues formed with acyl moieties or y 

of free hydroxyl groups derivaCive 

^iTs' ^raily s^:° all POlVPeptide seguences 
mcludes - generally P re combinant protein 

Tar Ty^aTly^r: ST-r may ta.e place 



12-01-2001 



Printed: 14-03-2002 



DISC 



01 100762 



10 



15 



20 



11 

intrasequential-ly -or at the N- or C- terminus of the 
sequence of component (1) and/or (2) of an inventive 
recombinant protein'.- In a preferred embodiment of the 
present invention, additional amino acids are appended to 
the N-terminus of component (l) or the C-terminus of 
component (2) • coinciding with the N-terminus and the C- 
terminus of the inventive recombinant protein. In another 
preferred embodiment, additional amino acid sequences are 
inserted intrasequentially, preferably in such a way that 
the secondary and/or tertiary structure is not destroyed. 
Typically these insertions are placed at the surface of 
the protein, e.g. in 6-bends. Preferably, one or more S- 
containing residues (particularly Cys) are inserted or 
other residues with a potential for binding heavy metal 
atoms (e.g. Hg-ions) . The introduction of additional 
heavy metal binding residues at sites on the surface of 
the recombinant protein of the invention may be 
accompanied by substitution and/or deletion of native 
heavy metal binding residues in order create novel heavy 
metal atom binding sites. Such a procedure is 
particularly suitable for gaining additional phasing 
information for structure determination of large protein 
complexes by X-ray crystallography. 
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In a non-limiting manner, "tag" -sequences may be 
contained in the recombinant protein and, particularly, 
may be added to the N- or C-terminus of the recombinant 
protein of the invention (claim 7). These "tag" -sequences 
typically have antigenic character for commercially 
available antibodies, e.g. an N-terminal "Flag-tag" 
having the sequence DYKDDDDK (one-letter-code) . Other 
suitable "tag" -sequences are, for example, N- or C- 
terminal polyhistidine tags. 



35 



Furthermore, component (l) and/or (2) as parts of the 
recombinant protein of the invention may be fusion 
proteins. Particularly . preferred are sequences fused to 
the N-terminus of the native sequence of component (l) or 
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to the N-terminus of an analog, derivative or fragment 
thereof. For example, component (1) of the recombinant 
protein may be fused N-terminally to a marker protein, 
e.g. an enzyme marker or a fluorescence marker, such as 
GFP (green fluorescence protein), or any sequence being 
suitable as epitope for. an antibody or even to an 
antibody or an antibody fragment, itself.. 

Finally, "fragments" of the native sequence of any 
protein being used as component (1) or (2) of the 
recombinant protein according to the present invention 
may be used, e.g. fragments of proteins of the myosin or 
kinesin protein superf amilies, particularly fragments 
being deleted C-terminally , the deletion comprising at 
least ten, and more preferably at least 50 amino acids. 
However, the fragment of the native sequence may also 
contain deletions at the N- and/or C- terminus and/or 
intrasequentially in component (1) and/or component (2) 
of a recombinant protein of the invention. In a preferred 
embodiment, component (1) consists of a fragment 
comprising the catalytic domain of a member of the myosxn 
or kinesin protein superf amilies of any eukaryotic 
organism. In other words, component (1) corresponds 
preferably to a fragment containing the myosin or kinesin 
motor domain. Within the scope of the present invention 
are therefore recombinant proteins characterized in that 
they contain as component (1) an amino acid sequence for 
the motor domain of a kinesin or myosin family member or 
an analog, fragment or derivative thereof (claim 3). 

in a preferred embodiment of the present invention, the 
recombinant protein according to the present invention 
contains as component (1) an amino acid sequence of a 
member of the myosin I, II, HI, IV, V, VIII, VI, X, or 
XI or a member of kinesin I or II families or an amino 
acid sequence of an analog, fragment or derivative of a 
member of the aforementioned myosin and kinesin families 
(claim 2) . Preferably, component (1) contains a member of 
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l^lcT? " famUy ° f ^ 6Ukary0tic ™- °r an 
preferred, component (1 ) contains myosin II o£ 
Oictyostelium or an analog, fragment or dsiivat °l 

invention ? rChSr PIeferred -*°*~»» <* th. present 
mZr I £0r C r P ° nen C " e Pr ° teins staining the 

motor domams of - smooth . muscle myosin II (e.g. chicken 
gi~ard myosin, , vertebrate or amoeboid fQrms * ^ 

verlhLt, UShb ° rder my ° Sin) ' -yolD 
vertebrate myos ln V , myosin VI, Toxoplasma gondii e g 

(human ki n esin !,, amoaboid or fungal kine 
£>2ctyostelium kinesin 7) . 

Preferably, a recombinant protein according to the 
present invention contains as linker component ,3, a 
stretch f c £ at least 3 amino acids, more preferably 5 and 

pre erred Ts" " ^ Ocularly 

Tz<nr- uhioh contains * — 

-y be used, for exaTpTe X Pr ° t ~" 
... . . example, the cleavage site may contain 

the recognition sequence for factor Xa, thrombin or for 
the protease tbv (recognition sequence, enlyfqg) or the 
Soldati protease. However, it is within the scope of this 
invention that component (1, and ,2, are directly fused 
together without insertion of a linker sequence. 

If linker component (3! consists of three amino acids it 
rs preferred to chose a sequence with at least one G!y 
residue, particularly in the second position of the 
linker stretch (claim 4). still further preferred is a 
linker with the following sequence, N-Leu-Gly-Arg-C. 

Recombinant protein according to any of the preceding 
claims characterised in that it contains as component „f 
(target protein) the sequence of an esterase, hydrolase 
Phosphatase, kinase, protease. channel, structura" 
protein (e.g. coronin, spectrin,, receptor, particularly 
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a neuronal or immunologically relevant receptor Uj. 
f.milv of TNF receptors), transcrrptron factor, 
Protein, lipoprotein oncoprotein or an 
analog, derivative or fragment thereof (clarm 5) . 

„ r^larlv a recombinant protein according to the 
particularly, a amino acid 

present invention has as .component • . (1) J» .« ^ 
seguence as exhibited in . f rgure * 
derivative and/or fragment thereof. It xs P 
combine the seguence of figure ^^^JitM 
(3) containing a protease 6) ? StiU 

— r sir rr^ g — . ~ • 

sequence as shown in fig. 7. 

- ^ nresent invention relates to a DNA 
A second aspect of the present 

-u .nnt-pins a sequence which codes 
sequence which contains a q DrQtein) according 

. , ea _. en(< p (for a recombinant protein; 
amino acid sequence (for particular, the 

to the present invention (claim 8). IP 
pr esent invention- provides , . ^^.^^ fr om 
the group consisting of : - ^ ^ 

the coding region of a r ec es capable of 

the present invention, (b) DNA qu moderate ly 
hybri di Z ation to a sequence ^ ^ ^ _ 
stringent conditions , a nd c) D ^ ^ ^ _ 

degenerate as a result ^ gpecif ic 

sequences defined in (a) and (b) invention is 

embodiment of the above DNA seque c of he^ 
a DNA sequence comprising at least p > 8< 
encoding for a recombinant protein as ^ ^ 
particularly the segment - fig. ^ ^ 

myosi n motor domain^ J- 1 -' invention may be 

a recombinant protein of tte p .^.^ 

detected, obtained and or modified, ^ ^ ^ 

and/or in vivo, by the use chem ical 

amplification techniques, such as ** ^ ^ 

35 oligonucleotide synthesis. PC* ^ ^ 
amplification (increase in number) 
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sequence by repeated " DNA polymerase reactions. This 
reaction may be used as .a replacement for cloning. All 
that is required is a knowledge - of the nucleic acid 
sequence. In order to carry out PCR, primers are designed 
5 which are complementary to the sequence of interest. The 
primers are then generated by automated DNA synthesis. 
Because primers may be defined to hybridize to any part 
of the gene, conditions may be created such that 
mismatches- in the complementary base pairing may be 

io tolerated. Amplification of these mismatch regions may 
lead to the synthesis of a mutagenized product resulting 
in the generation of a polypeptide with new properties 
(site-directed mutagenesis) . Also, by coupling 
complementary DNA (cDNA) synthesis, using reverse 

15 transcriptase, with PCR, RNA may be used as the starting 
material, for the synthesis of a recombinant protein of 
the invention. Furthermore, PCR primers may be designed 
to incorporate new restriction sites or other features 
such as termination codons at the end of the segment to 

20 be amplified. This placement of restriction sites at the 
5 f and 3' ends of the amplified nucleic sequence allows 
for gene sequence including a recombinant protein of the 
invention or a fragment thereof to be custom designed for 
ligation with other sequences and/or cloning sites in 

25 vectors. 

c 

PCR and other methods of amplification of RNA and/or DNA 
are well known in the art and may be used according to 
the present invention without undue experimentation. 
Known methods of DNA and RNA amplification include 

30 polymerase chain reaction and related amplification 
processes (Innes et al . , PCR Protocols: A Guide to Method 
and Amplification) and RNA mediated amplification which 
uses antisense RNA to the target sequence as a template 
for double stranded DNA synthesis (US Patent 5,130,238). 

35 In an analogous fashion, a recombinant protein of the 
invention being composed of components 1, (2) and 3 as 
defined above may be prepared, whereby components 1, (2) 
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and 3 are ligated on a genetic level forming a DNA 
sequence of the invention, which, is used to express a 
recombinant protein of the invention in a suitable host 
system. 

5 Also provided by the .present invention are vectors 
encoding the above recombinant . protein, and. analogs, 
fragments or derivatives of the invention, which contain 
the above DNA sequence of the invention these vectors 
being capable of being expressed in suitable eukaryotic 
10 or prokaryotic host cells (claims 9, 10) . Particularly 
preferred are vectors of the invention, which are capable 
of being expressed in cells of the species Dictyostelium 
(claim 11) . 

In this embodiment (expression vector) of the present 

15 invention the DNA sequence is operably linked to a 
promoter, preferably linked upstream. The promoter will 
preferably be an eukaryotic promoter, particularly a 
constitutive promoter. The. transcription of a DNA 
sequence according to the invention in cells of higher 

20 eukaryotes may be derived from viral genomes. Examples 
would be polyoma viruses, retroviruses, adenoviruses, 
cytomegaloviruses, SV40 and the like. With mammalian 
cells a possibility would be the S-actin promoter. In the 
current invention, the actinl5 promoter is particularly 

25 preferred for expression in DictyostGlium. If 
appropriate, other regulating elements of transcription 
and/or translation will be provided. Particularly 
preferred are cis-acting elements, like enhancer 
sequences, which usually include 10 to 3 00 base pairs and 

30 act upon the promoter to raise the transcription rate. 

They may be arranged in the 3» or 5 1 position of the DNA 
sequence according to the invention, but also in the 
coding sequence itself or in an intron sequence which is 
cut out by splice procedures. Further regulating elements 
35 may serve to regulate transcription termination, so that 
the expression of mRNA is involved. 
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If necessary, the expression vector with the DNA of the 
invention are developed as shuttle vectors, that is, they 
are able to replicate in a host system and can then be 
transfected into another host system for purposes of 
5 expression. For instance, a vector can first be cloned in 
E.coli and then be inoculated into Dictyostelium, yeast 
or any mammalian cell for expression. 

Typically, such expression and cloning vectors include at 
least one selection gene exercising a marker function. 
10 This is a gene allowing host cells to survive or grow 
after being transformed by the vector. Typical selection 
genes code for proteins which permit resistance toward 
antibiotics or other toxins. This, for instance, includes 
puromycin, ampicillin or neomycin. 

15 Host cells, particularly eukaryotic host cells, 
transformed with an expression vector according to the 
invention are another subject of the present invention 
(claim 12) . Appropriate host cells for cloning or 
expressing the DNA sequences are prokaryotic cells, yeast 

20 or higher eukaryotic cells. In a preferred embodiment, 
cells for expressing DNA sequences according to the 
invention are selected from multicellular organisms. This 
also takes place before the background of the function of 
component (1) of the recombinant protein of the invention 

25 to elements of the cytoskeleton (like actin or 
microtubules or even components of the cell membrane or 
membrane of any intracellular organelle, like 
mitochondria) . In principle every eukaryotic cell can be 
used as host cell, even though cells of mammals, like 

30 monkeys, mice, rats, hamster or humans, are preferred. 
Particularly preferred are cells from the species 
Dictyostelium (claim 13) . 

The present invention relates in a further aspect to a 
method for producing a recombinant protein according to 
35 the invention, whereby the method comprises the following 
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steps: (a) preparing a vector according to the invention, 
(b) transforming eukaryotic host cells with a vector 
obtainable from step (a) , and (c) growing transformed 
host cells, of the invention and obtainable from step (b) 
under conditions suitable for the expression of said 
recombinant protein (claim .14) . The expression method of 
the invention allows for pverexpression of any target 
protein or polypeptide of at least 20 amino acid length 
(component (2)) as segment of the recombinant protein of 
the invention. Accordingly, huge amounts of target 
protein as part of a recombinant protein of the invention 
are produced by the method of the invention. It is 
preferred within the scope of the present invention to 
concentrate the overexpressed recombinant protein in the 
cell. This is achieved by constructing recombinant 
proteins of the invention, which do not carry any leader 
sequences for secretion out of the transformed host cell. 



25 



Another aspect of the present invention is a method for 
purifying a recombinant protein according to the 
20 invention or any other recombinant protein, which 
contains an amino acid sequence binding to cytoskeleton 
(actin or microtubules or proteins being bound to actin 
in the cell) or membrane (e.g. inner cell membrane or 
outer or inner membrane of a cell organelle) structures 
and another amino acid sequence (target sequence to be 
analysed) , whereby the method comprises (a) preparing a 
vector according to the invention or a vector encoding 
for any recombinant protein (as disclosed above) , (b) 
transforming eukaryotic host cells with a vector 
3 o obtainable from step (a), (c) growing transformed host 
cells according to the invention and/or obtainable from 
step (b) under conditions suitable for the overexpression 
of said recombinant protein, (d) purifying overexpressed 
recombinant protein by binding to endogenous elements or 
35 structures of the cytoskeleton or membrane, like 
actin/microtubules, of the eukaryotic host cell, and (e) 
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releasing bound recombinant protein from these structures 
or elements, preferably actin or microtubules (claim 15). 

In apreferred embodiment of said method for purifying of 
the invention provides in step (e) , the releasing step, a 
separation from the structures or elements of the cell by 
adding a -substrate, be if a natural or non-natural 
substrate, of component (1) of the recombinant protein 
(claim 16). Whereas in general the natural substrate will 
be used, it may be preferable in certain cases to use a 
non-natural substrate of component (1), like GTP or 
(nucleotide) analogues (where ATP is the natural 
substrate), for releasing purposes. In general, any 
substrate with the potential to release the bound 
recombinant protein, particularly by binding to the 
component (1) of the recombinant protein from the cell 
structure or element is suitable to be used for step (e) . 
It will be appreciated that a method of the invention 
using a member of the kinesin or myosin superfamily or a 
derivative, fragment or analog thereof as component (1) 
is particularly preferred, if it is characterized in the 
addition of ATP, which is the natural substrate for these 
proteins with motility function (claim 17) . 

In a still further preferred embodiment, the purification 
method of the invention comprises an additional step (f ) . 
Step (f) may typically provide at least one additional in 
vitro purification step, whereby all common purification 
procedures available may be provided, for instance all 
procedures described by A. Mc Pherson (Crystallization of 
Biological Macromolecules, Cold Spring Harbor Laboratory 
Press, NY, 1999), which is incorporated herein by 
reference. In a non- limiting manner, the following 
methods, particularly biocxhemical and/or physical 
methods, may be used or combined: salt fractionation, 
desalting, fractionation with organic solvents or with 
other precipitants; selection with heat/pH, 

centrifugation, chromatographic methods, e.g. ion exchange 
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chromatography, . molecular-. sieve chromatography, 
adsorption chromatography, affinity chromatography or 
HPLC ultrafiltration, isoelectric focussing and/or 
electrophoresis by biochemical, particularly 

chromatographic, and/or physical methods (claim 18). 
Affinity chromatography . is • particularly preferred, 
whereby metals (e.g. Ni) and/or antibodies are .typically 
bound to a resin as ligands (claim 19). The affinity 
chromatography may typically be carried out in batch mode 
or by a column packed with an insoluble support matrix. 

A further aspect of the present invention is a 
recombinant protein, particularly in isolated and/or 
purified form, obtainable from a method for producing of 
the recombinant protein of the invention as descrxbed 
herein (claim 20) . 

A still further aspect of the present invention is a 
method for crystallizing a. recombinant protein according 
to the invention characterized in that it comprises (a) a 
purification step according to a method of the invention 
and (b) a crystallization step (claim 21). Hereby, the 
purified recombinant protein obtained in step (a) is 
crystallized by any method known by the skilled person. 
The crystallizing step will be carried under conditxons 
suitable for crystal growth. The conditions may be 
optimized by varying certain parameters, such as stock 
solution, concentration of the recombinant protein 
temperature, pH, ionic strength, precipitating agent 
(e.g. ammonium sulfate or PEG), addition of small amounts 
of organic solvents etc. However, the conditions used for 
crystallization of component (1) alone are preferred 
which means that the conditions suitable for a member of 
the myosin or kinesin superfamily or a fragment, analog 
or derivative thereof may also work to identify crystals 
of the recombinant protein of the invention. In order to 
accelerate the crystallization process, it ib 
particularly preferred to apply a recombinant protexn of 
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the . invention containing as component (1) an amino acid 
sequence with a flexible region, particularly a flexible 
at C- terminal end of component (1) . Thereby, a high 
degree of flexibility of the. components is achieved 
resulting in numerous conformations which can be occupied 
or sampled by the components in the course of the 
crystallization process'.. 



10 
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It is preferred to employ vapor diffusion techniqes 
either by the hanging or the sitting drop method to 
obtain crystals. Furthermore, crystallization may be 
achieved by induction of nucleation. Macro- or 
microseeding methods are mentioned as described by A. Mc 
Pherson (Crystallization of Biological Macromolecules , 
Cold Spring Harbor Laboratory Press, NY, 1999), which is 
- by its whole contents - incorporated as disclosure of 
the present application by reference. 
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Another aspect of the present invention is a protein 
crystal characterized in that the crystal is built by a 
network of recombinant proteins according to the 
invention (claim 22) . This network forms the crystal 
lattice. Within the scope of the present invention are 
crystals of any space group in which identical proteins 
can be arranged. A crystal of the invention may contain 
one, two, three or more recombinant proteins per 
asymmetric unit. At least one heavy atom may be located 
at (a) certain position/s in the recombinant protein 
being arranged symmetrically in the crystal of the 
invention. Crystals may contain ligands non-covalently 
bound to the crystallized recombinant protein as well, 
e.g. ATP, inhibitors, alkali ions or physiological 
ligands, such as hormones, carbohydrates, protein 
fragments etc. 



Finally, an aspect of the present invention is a method 
for elucidating the atomic structure of a protein crystal 
35 of the invention, whereby, after a crystallization step 
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(a) according to the invention, X-ray diffraction data 
are collected on a beamline or any kind of device 
suitable for measuring locations of X-ray reflections 
(diffractometer, (b) ) . In final step (c) , the atomic 
5 structure or rather the electron density map (into which 
the polypeptide chain and, eventually, other ligands and 
water molecules are modeled) . of . a . recombinant protein is 
calculated by Fourier transformation of . the . data set 
obtained in step (b) using phasing information obtained 
10 by anomalous scattering, the heavy atom method or 
molecular replacement techniques (claim 23), as e.g. 
described by Stout & Jensen (X-ray Structure 
Determination, Wiley, NY, 1989), which is incorporated 
herein by reference. For the present invention molecular 
15 replacement methods are particularly useful. The phasing 
information may be obtained from component (D as 
starting model, which is typically a structurally well 
determined polypeptide. Therefore, component (1) is a 
"helper" sequence providing the starting information to 
solve the structure of the recombinant protein or the 
structure of component (2) , respectively, which is the 
target protein to be structurally analysed. Further 
rounds of structure refinement by methods known by the 
skilled person or described by Stout & Jensen may serve 
to improve the structure model. Additionally, heavy atoms 
may be bound to known sites of component (1) of the 
recombinant protein of the invention. Thereby, additional 
phasing information may be obtained for structure 
elucidation of target component (2) (which is under 
analysis) of the recombinant protein of the invention. 
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The use of a recombinant protein of the invention for 
purification and crystallization purposes has 
unprecedented advantages over the methods known in the 
art. The recombinant protein via its component (1) binds 
to insoluble components of the cell, like the 
cytoskeleton, membrane components or the like. Following 
cell lysis, the recombinant fusion protein (or rather its 
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component (2), which- is wanted - to be purified, analysed 
or subjected to X-ray analysis) can be enriched by ligand 
depletion and precipitation with the insoluble 
interaction partners of the cell. This allows for a 
purification step already carried out in the cell without 
any additional. Therefore, it is not the lysate as a 
whole which contains • the overexpressed protein but the 
pre-purified precipitate itself. The specific 
solubilization of the fusion protein is achieved by 
addition of the ligand to the insoluble fraction. For 
crystallization, the conditions (parameters) are 
preferably chosen such that they coincide with the 
conditions for structurally well characterized component 
(1) . These conditions or subtle variations of these 
conditions are expected to work for the recombinant 
protein as well. Hence, the method of the present 
invention for crystallizing allows to • find 
crystallization conditions without extensive search for 
suitable parameters required by the art. 

However, it is within the scope of the present invention 
that a recombinant protein of the invention or any other 
recombinant protein which is purified according to a 
method of the present invention may be structurally 
analysed by any other method known by the skilled person. 
Particularly, such recombinant proteins may be subjected 
to NMR analysis (two-dimensional or multidimensional) as 
described by Roberts (1993, NMR of Macromolecules : A 
practical approach, Oxford-New York) , which is 
incorporated herein by reference. Furthermore, the system 
of the present invention may be used for drug design 
(ligand to component (2) of the recombinant protein used) 
as described by Craik (1996, NMR in drug design, CRC 
Press, Boca Raton), which is incorporated by reference. 
Other methods of structure eclucidation are, for 
instance, mass spectrotometry as described by Siuzdak 
(1996, Mass Spectrometry for Biotechnology, Academic 
Press, San Diego), incorporated herein by reference. 
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Another object of the present invention is the following 
subject-matter, namely a method for isolating and 
identifying proteins which are capable of binding to the 
target sequence used as component (2) .in the recombinant 
protein (particularly of the invention) . Therefore, a 
yeast-two-hybrid system may be used, by which a sequence 
encoding the recombinant protein is carried by one hybrid 
vector and sequence carried by the second hybrid vector, 
the vectors being used to transform yeast host cells and 
the positive transformed cells being isolated, followed 
by extraction of the said second hybrid vector to obtain 
a sequence encoding a protein which binds to said 
recombinant protein directly or indirectly via other 
proteins. 

Furthermore, according to the. present invention, an 
approach/method suitable for the identification of 
j binding partners to the recombinant protein of the 
invention may comprise the following steps: (a) a library 
of cDNA is typically fused to the C-terminus of component 
(1), particularly of a myosin motor domain (MMD) , 
(typically resulting in a recombinant protein of the 
5 invention), eventually via a linker sequence; (b) this 
recombinant protein is expressed in Dictyosteiium or 
another eukaryotic system; (c) clonal transformants are 
probed with the bait-protein of choice fused to any 
marker protein, e.g. fl-galactosidase; and (d) after 
J0 washing identification and determination of interacting 
recombinant protein by measuring the activity of bait- 
marker fusion protein, e.g. by addition of fl-gal . In a 
preferred embodiment, the myosin motor domain may be his 
or epitope-tagged at the N-terminus. Typically all steps 
35 of the method of the invention are carried out in 
microtiter well plates. 
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Preferably, the recombinant protein shown to have bound 
to the bait-protein of choice may be purified by the 
methods of the invention and can then be subjected to 
5 further biochemical or structural characterization, e.g. 
crystallization as described above, with or without 
cleavage by a protease, ' if a recognition in a linker 
region has been provided, in order to release component 
(2) , •■ the- target ' protein. The method of the invention is 
10 suitable for the identification of unknown binding 
partners and may also be used to demonstrate the 
interaction between two known polypeptides. 

The method of isolating yet unknown binding partners of 
15 the invention has numerous advantages over the method 
known in the art. E.g. MMD-fusion proteins may be easily 
purified from Dictyostelium and the MMD fusion system may 
be transferred to a wide range of high eukaryotic cells. 
Furthermore, (i) the MMD-cDNA constructs may be directly 
20 used for expression in Dictyostelium and other eukaryotic 
cells; (ii) decreased background (since the system works 
with purified proteins and not with proteins within a 
cellular environment that, as in the case of the yeast 2- 
hybrid-system, leads to a high background of false 
25 positive clones); (iii) easy identification and isolation 
of the positive construct from mother plates; and (iv) 
the procedure may be highly automated, since all steps in 
the interaction screening may be performed in microtiter 
well plates. 

30 



Description of figures 

35 In figure 1 the structure of M761-2R-R238E, an example for 
a recombinant protein of the invention, is shown. Although 
two molecules are present in the crystallographic 
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asymmetric unit, only one is shown here. The two molecules 
are essentially identical throughout the myosin motor 
domain (residues 2-7 61) exemplifying component (1) of the 
recombinant protein of the invention. However, upon 
5 leaving the converter domain, the lever arms assume 
slightly different orientations and- deviate at the ends by 
19.4 A. In fig. 1(A) a complete molecule (recombinant 
protein of the invention) spanning amino .acids 2-1010 is 
shown. No electron density was observed for five residues 
io at the N-terminus, the loop region 205-208, and one 
residue at the C-terminus. The N-terminal domain (2-200) 
is shown in green, 50 kDa domain in red (201-613), C- 
terminal and converter domain in blue (614-761), linker 
region in orange (762-764) (component (3)) of the 
15 recombinant protein of the invention), a-actinin lever arm 
in yellow (765-1003) (component (2) of the recombinant 
protein of the invention) and seven histidines from the 
His 8 purification tag in gray (1004-1010), which are 
linked as specified for an preferred embodiment of the 
20 present invention. The linker region (component (3)) is 
composed of three residues (Leu-Gly-Arg) introduced during 
cloning. The observed lever arm is -140 A long (measured 
from Ca of 761 to. Ca of 1010) . Each a-actinin repeat 
contributes -65 A, and the histidine purification tag 
25 another 10 A. Helices 1-3 make up the first a-actinin 
repeat, and 4-6 the second. The arrowhead indicates the 
a-helical region linking the two repeats. The disruptive 
kink in helix 2 is caused by the presence of two adjacent 
proline residues (see fig. 5A) . In fig. KB) a detailed 
30 view of the linker region joining the myosin converter 
domain to helix 1 of a-actinin is depicted. The view is 
rotated 180° around a vertical axis from that in fig. 
1 (A) . 
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In figure 2 a detailed view of the conserved salt bridge 
linking switch - 1 and switch II is shown as a result of 
purifying, crystallizing a recombinant protein of the 
invention and finally solving the structure of that 
protein according to., methods of the .invention. The 
conserved nucleotide binding/sensing elements found in 
all myosins, kinesins, and G-proteins are highlighted for 
the P-loop in blue, switch I in green, and switch II in 
red. Fig. 2(A) shows the structure of Dictyostelium 
myosin II motor complexed with Mg-ADP-BeF 3 . As in Mg-ADP- 
VO< (Smith and Rayment, 1996a) and Mg-ADP-BeF 3 (Dominguez 
et al., 1998) structures, switch I and switch II are 
closed. The conserved salt bridge between residues R238 
and E459 is shown as a ball-and-stick model surrounded by 
2.6 A experimental 2f G -f c electron density (blue wire- 
frame), contoured, at 1 o. As expected for a salt bridge, 
the electron density is continuous between the residues, 
which point toward each other. In fig. 2(B) the same 
region as observed in the crystal structure of M761-2R- 
R238E is shown. The electron density was calculated from 
a model with alanins at positions 238 and 459 in order to 
eliminate model bias. Electron density for two glutamic 
acid residues is clearly visible, but the side chain of 
E238 now points away from E459 and the switch II loop has 
moved away from switch I. m fig. 2(C) the same region 
showing a superposition of the M761-2R-R238E structure 
with a structure of Dictyostelium, myosin II motor 
complexed with Mg-ADP-V0 4 (PDB code 1VOM) (Smith and 
Rayment, 1996a) . The nucleotide and R238-E459 salt bridge 
are shown as ball-and-stick models. Both the P-loop and 
switch I regions are in essentially identical 
conformations in both structures. However, the switch II 
region (red) shifts to the right, toward the nucleotide, 
by ~5 A in the Mg-ADP-V0 4 structure, allowing the 
formation of the R238-E459 salt bridge. 
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Figure 3 shows the orientation of the myosin lever arm, a 
segment of component (1) of an example for an recombinant 
protein of the invention.' Shown in yellow are five 
molecules of actin making up part of a helical actin 
filament. Modeled on to- this are myosin in the "pre- 
power-stroke" up/closed orientation in red, the "post- 
power-stroke" down/open orientation in blue, and the 
M7 61-2R-R238E structure in green. The up, down, and 
actomyosin complex structures were modeled. The M7 61-2R- 
R238E structure was then aligned to the core domain of 
the down/open structure via residues 160-200, which 
includes the highly conserved P-loop region. It is noted 
that in the M761-2R-R238E structure, the helix leaving 
the converter domain initially superposes with the 
down/open structure, but then deviates due to the 
different helical bend of the a-actinin. 
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In figure 4 the structure of a-actinin repeats 1 and 2 
are shown. a-Actinin is an example for component (2) of 
the recombinant protein of the invention, which means ex- 
acting is the target protein in this example. Its 
strucure was solved using purification and 
crystallization methods of the present invention. The 
structure of the fig. 4(A) depicts in yellow an a-carbon 
chain trace of the 6 helices making up repeats 1 (helices 
labeled 1-3) and 2 (helices labeled 4-6). The 17 
hydrophobic aromatic amino acid residues stabilizing the 
triple-helical packing are shown in green (7 tyrosines, 6 
phenylalanines and 4 tryptophans). Shown in red are two 
adjacent proline residues, which cause a kink, but not a 
break in a-helix 2 of repeat 1. The uninterrupted a-helix 
linking repeats 1 and 2 is shown in orange. Fig. 4(B) 
exemplifies a detailed view of the linker region, 
highlighting the stabilizing hydrophobic and hydrogen 
bonding interactions. Colours and orientation are 
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identical to those in fig. 4(A). Side chains are shown as 
ball-and-stick models, with the exception of Asp796 and 
Ser797, in which only the a-carbon atoms involved in 
hydrophobic contacts are shown for clarity. The salt 
bridge between Arg880 and Glu877, and the hydrogen bond 
between Arg880 and the carbonyl oxygen of Leu956 (also 
shown as a ball-and-stick model), are shown as dashed 
lines . 



10 Figure 5 is a comparison of Dictyostelium a-actinin with 
human a-actinin and human a-spectrin. Fig. 5(A) shows the 
overlapping repeat 2 region of Dictyostelium (yellow) and 
human (blue) a-actinin are shown as ribbon diagrams. 
Helices are numbered as described above for Dictyostelium 

15 a-actinin and, in parentheses, as described previously 
for human a-actinin (Dj inovic-Carugo et a J . , 1999). The 
largest differences occur in the loop region connecting 
helices 4 and 5, indicated by an arrow, where the human 
a-actinin structure would seriously overlap with 

20 Dictyostelium helix 6. In fig. 5(B) the alignment of 
Dictyostelium repeat 2 (yellow) with repeat 16 human 
a-spectrin (green) is shown as ribbon diagrams. Helices 
are numbered as described above for the Dictyostelium 
protein and, in parentheses, as described previously for 

25 the human protein (Grum et a 1 . , 1999). Dictyostelium 
helix 4 and a-spectrin helix A, which are in the 
background, are colored white for clarity. In general, 
the two structures align more closely than, the human/ 
Dictyostelium alignment described above. The largest 

30 difference occurs in the loop region connecting helices 5 
and 6, indicated by an arrow, where the human a-spectrin 
structure is moved in respect to the Dictyostelium 
a-actinin structure as a result of a proline-induced kink 
in helix B. 
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Fig. 6 shows the amino acid sequence (one-letter-code) 
for component (1) of recombinant protein M761-2R R238E 
exemplifying a recombinant protein of the invention. 
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In fig. 7 the whole sequence of recombinant protein M7 61- 
2R R238E is depited comprising as. component (1) the ammo 
acid sequence of the myosin II motor domain of 
Dictyostelium, a three amino acid linker region ( ) as 
component (3) and the a-actinin amino acid sequence being 
the target sequence in this example {one-letter-code) . 
Fig. 8 is the DNA sequence coding for recombinant protein 
M761-2R R238E such that the sequence of fig. 8 
corresponds to the sequence of fig. 7 on the genetic 
level . 
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Examples of the present invention 



Example 1 



20 



25 



30 



35 



(a) Expression 

The expression-vector P DXA-3H, that was used for the 
production of M761-2R R238E, carries the origin of 
replication of the Dictyostelium high copy number plasmid 
Ddp2 (Leiting et al. 1990, Molecular And Cellular Biology 
10, 3727-3736; Chang et al. 1990, Nucleic Acids Research 
17, 3655-3661), an expression cassette consisting of the 
strong, constitutive actinlb promoter, a translational 
start codon upstream from a multiple cloning site (MCS) , 
and sequences for the addition of a histidine octamer at 
the carboxy terminus of any protein. Plasmids derived 
from pDXA-3H were transformed into orf -cells. These cells 
carry several integrated copies of the rep gene which is 
essential in trans for the replication of plasmids that 
carry the Ddp2 origin (Leiting et al . 1990, Molecular And 
Cellular Biology 10, 3727-3736; Slade et al. 1990, 
Plasmid 24, 195-207) . The myosin-a-actinin fusion was 
created by linking codon 761 of the Dictyostelium mhcA 
gene to codon 264 of the Dictyostelium a-actinin gene. 
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The resulting construct pDH12-2R extended to codon 505 of 
the ot-actinin gene. Plasmid pDH20 was generated by 
insertion of the first 765 codons of Dictyostelium myosin 
II into the MCS of pDXA-3H (Furch et al. 1998, 
Biochemistry, 37, 6317-6326). Site directed mutagenesis 
was used to generate plasmid pDH20(R238E) encoding a 
motor domain fragment with the single point mutation 
R238E. Replacement of the 2 kb Safl-BstXI fragment of 
PDH12-2R with the corresponding fragment from 
P DH20(R23BE) was used to generate the expression vector 
for the production of M761-2R R238E, the fusion protein 
and an example for a recombinant protein of the 
invention, containing thus both a point mutation in the 
active site and a C-terminal extension consisting of two 
ct-actinin repeats. 



(b) Purification 

The overexpressed protein was purified by Ni 2+ -chelate 
affinity chromatography as described by Manstein and Hunt 
20 (J. Muscle Res. Cell Motil . , 1995, 16, 325) und Manstein 
et al. (Gene 1995, 162, 129). Both afore-cited documents 
are incorporated into the disclosure of the present 
invention by their complete contents. 

25 Cells expressing the histidine octamer tagged fusion 
protein were grown in 5 1 flasks containing 2.5 1 DD- 
Broth 20. DD-Broth 20 contains (per litre): 20 g protease 
peptone (Oxoid) , 7 g yeast extract (Oxoid) , 8 g glucose, 
0.33 g Na 2 HP0 4 .7H 2 0, and 0.35g KH 2 P0 4 . The flasks were 

30 incubated on a gyratory shaker at 200 rpm and 21°C. Cells 
were harvested at a density of 6 x 10 6 ml -1 by 
centrifugation for 7 min at 2,700 rpm in a Beckman J-6 
centrifuge and washed once in phosphate buffered saline. 
The wet weight of the resulting cell pellet was 

35 determined. Typically, 35 g were obtained from a 15 1 
shaking culture. The cells were resuspended in 140 ml of 
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Lysis Buffer (50 mM Tris.HCl, pH- 8.0, 2 mM EDTA, 0.2 mM 
EGTA, 1 mM dithiothreitol (DTT) , 5 mM benzamidine, 40 
mg/ml.TLCK, 20 mg/ml N-tosyl-L-phenylalanine chloromethyl 
ketone (TPCK), 200 mM phenylmethylsulfonyl fluoride 
(PMSF) , 0.04 % NaN 3) . 

Cell lysis was induced by the addition of 70 ml of Lysis 
Buffer containing 1 % Triton-XlOO, 15 mg/ml RNaseA 
(Sigma) and 100 units of alkaline phosphatase . . The lysate 
was incubated on ice for one hour. Upon centrifugation 
(230,000g, 1 hour), the recombinant protein remained in 
the pellet. The pellet was washed in 100 ml of HKM buffer 
(50 mM HEPES, pH 7.3, 30 mM KAc, 10 mM MgSO«, 7 mM b- 
mercaptoethenol, 5 mM benzamidine, 40 mg/ml PMSF) and 
centrifuged for 45 min at 230,000g. The recombinant 
protein was released into the supernatant by extraction 
of the resulting pellet with 60 ml HKM buffer containing 
10 mM ATP. After centrifugation (500,000g, 45 min.), the 
supernatant was loaded using a peristaltic pump onto a 
Ni 2+ -nitrilotriacetic acid (Ni 2 *-NTA) affinity column (1.5 
x 10 cm) (Qiagen). The flow-rate was adjusted to 
approximately 3 ml min" 1 . After loading was completed the 
column was connected to a Waters 650M chromatography 
system. The column was washed briefly in Low Salt buffer 
(50 mM HEPES, pH 7.3, 30 mM KAc, 3 mM benzamidine), High 
Salt buffer (as Low Salt Buffer, but with 300 mM KAc), 
and Low Salt Buffer containing 50 mM imidazole, the 
recombinant myosin was eluted using a linear gradient of 
Low Salt Buffer and Imidazole Buffer (0.5 M imidazole, pH 
7.3, 3 mM benzamidine), starting with 10 % Imidazole 
Buffer and reaching 100 % after 15 minutes. The flow rate 
was 3 ml min" 1 and 3 ml fractions were collected. 
Absorbance at 280 nm was monitored. SDS gels were run to 
check the purity of the eluted protein. 
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The pooled fractions were dialysed immediately against 
Storage Buffer (20 mM HEPES, 0.5 mM EDTA, 1 mM DTT, pH 
7.0) containing 3% sucrose and the purified protein could 
be stored at -80°C for several months without apparent 
5 loss of enzymatic activity. Actin-activated ATPase 
activity was measured by the release of inorganic 
phosphate . 

(c) Crystallization 

10 Crystals of the overexpressed and purified recombinant 
protein M7 61-2R R238E were grown by the hanging drop 
method at 7°C. The drops contained equal volumes (2.2 /zl) 
of the protein solution and the mother liquor. The mother 
liquor contained 12% PEGM 5K. 170 mM NaCl, 50 mM HEPES- 

15 NaOH pH 7.2, 5 mM MgCl 2 , 5 mM DTT, 0.5 mM EGTA and 2% 2- 
methyl-1, 3 -propanediol . The protein solution (5 mg/ml) 
contained additionally 200 ADP and 2 00 /xM vanadate, 

and was incubated on ice for 1 h before setting up the 
drops. Crystals normally appeared after 7-8 days and 

20 reached maximum dimensions of 0.1 x 0.3 x 0.9 mm. 
Crystals were transferred to a solution of mother liquor 
plus 3 0% glycerol and frozen in liquid nitrogen for 
storage and data collection. 

25 (d) Crystallography and structure refinement 

Diffraction data for the crystals of the recombinant 
protein M761-2R R238E were collected at ESRF beamline ID- 
13 on a MarCCD detector and integrated and scaled using 
the program XDS (Kabsch, 1993, J. Appl . Cryst . , 26, 795), 

30 producing a data set 97.7% complete to 2.8 A with 4 -fold 
redundancy and an R sym of 11.0%. The M761-2R-R238E 
crystals belonged to space group P2x2i2 with two molecules 
in the asymmetric unit. Molecular replacement was 
performed with the program AMoRe (Navaza, 1994, Acta 

35 Cryst. A 50, 157) using the crystal structure of 
Dictyostelium myosin resides 2-759 complexed with Mg-ADP- 
BeF x (PDB code lmmd) (Fisher et al., 1995, Biochemistry, 
34, 8960) as a starting model (the nucleotide and the 



33 



12-01-2001 



Printed: 14-03-2002 



DESC 



0110076^ 



34 

side chains beyond Gfi of residues* 238 and 459 were 
excluded) . 



10 
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Initial maps showed clear helical density for the first 
repeat of the a-actinin lever arm, which was built as a 
poly-alanine model using the program 0 (7.0 for 
WindowsNT) . Following several- rounds of simulated 
annealing refinement using torsional dynamics and a 
maximum likelihood target with the program CNS vO - 9a 
(Brunger et al . , 1998, Acta Cryst . D, 54, 905), the 
second a-actinin repeat was visible and built. Subsequent 
rounds of model building and refinement (including bulk 
solvent correction) produced the final structure of two 
M761-2R-R238E molecules containing 1005 residues each, 
two molecules of Mg-ADP and 14 water molecules (R- factor, 
24.1%; Rfree/ 29,9%). Ramachandran analysis shows all non- 
glycine residues to be in allowed regions. Figures were 
made using the programs Bobscript (Esnouf , . 1997, J. Mol . 
Graph. Model., 15, 132) and Raster3D (Merritt and Bacon, 
1997, Methods Enzymol . , 277, 505). 
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Example 2 

Myosin-Fusion-System for isolating 

proteins/protein binding partners 



interacting 
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(a) Preparation 

In order to demonstrate the function of the myosin- 
fusion-system a library of cDNA was fused to the C- 
terminus of a myosin motor domain (MMD) and expressed in 
Dictyostelium or another eukaryotic system. Clonal 
transformants were probed with the bait-protein of choice 
fused to fi-galactosidase. The myosin motor domain was 
His- or epitope-tagged at the N-terminus. 

Experimentally, cells were transformed with the MMD-cDNA 
library and clones were grown and kept in 96 wells 
plates. The bait-B-gal fusion protein was transformed in 
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Dictyostelium Orf+. cells and, grown in an appropriate 
quantity (1 clonal cell line) . Upon reaching confluence, 
the MMD-cDNA clones in the 96 wells plates were washed 
once in the plates" with PBS and then lysed by adding 
5 lysis buffer containing Triton X-100 (or, alternatively, 
NP-40) , at the same time the ATP pool was depleted by the 
addition of " alkaline ' phosphatase. The actin-based 
cytoskeleton with all myosin and also the M765-fusion- 
proteins were pelleted by centrifugation and washed with 

10 lysis buffer. The myosin was released from the pellets by 
the addition of Mg 2+ -ATP. The ATP-unsoluble fraction was 
pelleted and the supernatant transferred to 96 wells 
plates coated with Ni-NTA. The His-tagged products of the 
MMD-cDNA were shown to bind to these plates. After 

is extensive washing the coated plates were incubated with 
the bait-fi-gal construct. Again, after extensive washing 
the plates were incubated with a substrate for ft-gal, in 
this case CPRG (red color OD574) or ONPG (yellow OD 4 i 5 ) , and 
the fi-gal activity was determined with a microtiter plate 

20 reader. High fi-gal activity indicated a strong 
interaction, between the bait and the product of the 
target cDNA. 

The selected clones were then recovered from the original 
25 96 wells plates. The MMD-cDNA- clone was expressed in and 
purified from Dictyostelium by standard myosin motor 
domain purification. For further biochemical and 
structural characterization, the isolated gene product 
was either cleaved with an appropriate protease to 
30 release it from the MMD or was used directly in the 
fusion form for kinetics or crystallization experiments. 

(b) Interaction Test 

The method of the invention was tested by expressing MMD- 
35 RaclA and DRG-2D-6-gal (the DRG-2D construct acts as an 
exchange factor for the small G-protein RaclA) . 
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The MMD-RaclA cells were cloned, grown in 96 wells 
plates, washed, lysed and ATP extracted as described 
above. The Ni-NTA coated plates were then incubated with 
s the ATP-released protein fraction. The cells expressing 
the DRG-2D-fi-gal were grown in shaking suspension and 
washed and lysed under the same conditions. The DRG-2D-B- 
gal supernatant was incubated at different dilutions. As 
controls wells were incubated without bait (DRG-2D-IJ-gal) 
io or without MMD-RaclA or with MMD alone. All controls were 
negative after staining for B-gal, whereas the. 
incubations with immobilized MMD-RaclA and the bait gave 
a signal, which was dependent on the concentration of 
added bait. 

is 

In conclusion, the interaction between DRG-2D and RaclA 
was shown by the method of the invention, whereas it 
could not be shown when using the yeast two-hybrid 
system. Therefore, the method of the invention has 
20 definitely advantages over the yeast-two-hybrid system or 
other techniques developed to identify protein-protein 
interactions and known in the art. 
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Recombinant protein characterized in that it 
contains as component (1) an amino acid sequence of 
a member of the myosin or kinesin protein 
superfamilies or an amino acid sequence of an 
analog, fragment or derivative of a member of the 
myosin or kinesin protein superfamilies, as 
component (2) any amino acid sequence of at least 20 
amino acids in length, and as component (3) a linker 
region of at least 2 amino acids between components 
(1) and (2) . 



is 



20 



Recombinant protein according to claim 1 
characterized in that it contains as component (1) 
an amino acid sequence of a member of the myosin I, 
II, III, iv, v, vili, vi, x, or XI or a member of 
kinesin I or II families or an amino acid sequence 
of an analog, fragment or derivative of a member of 
the aforementioned myosin and kinesin families. 

Recombinant protein according to claim 1 or 2 
characterized in that it contains as component (1) 
an amino acid sequence for the motor domain of a 
kinesin or myosin family member or an analog, 
fragment or derivative thereof. 



0 
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Recombinant protein according to any of the 
preceding claims characterized in that it contains 
as component (3) a stretch of 3 amino acids with a 
Gly in the middle. 

Recombinant protein according to any of the 
preceding claims characterized in that it contains 
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as component <2> the . sequence of an esterase, 
hydrolase, phosphatase, kinase, protease, channel, 
structural protein (e.g.; coronin, spectrin, 
rTePtor. a neuronal or "loyally 

relevant receptor «.. 9 . ^iZJtLZ, 
receptors). transcription factor. 

protein, lipoprotein or. glycoprotein or . an anlog, 
derivative or fragment thereof. 

Recombinant protein according to any of the 

i i-o «5 characterized in that it 
nrecedinq claims 1 to b cnai*^ 

contains 9 as component », an amino acid seouence a 
shown in figure 6 and as component (3) the linker 
amino acid sequence Leu-Gly- Arg . 

Recombinant protein according - any of preceding 
claims 1 to 6 characterized in that it contains , a 
tag sequence, particularly at the or C-terminus 
of the recombinant protein. 

DNA sequence characterized in that ' it contains j 
S equence which codes for an amino acid sequence 
- according to any of the preceding claims 1 to . 

9 . vector containing a DNA sequence according to claim 
8 . 

claim 9 capable of being 

10. Vector according to claim * r 

expressed in a eukaryotic host cell. 

xl vector according to claim 9 or 10 capable of being 
" expressed in cells of the species Dictyostelium. 

12 Transformed euxaryotic host cell containing a vector 
according to any of preceding claims 9 to 11. 
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13 . Transformed eukaryotic host cell according to claim 
12 characterized in that it is a cell from the 
species Dictyostelium. 



10 



14 . Method for producing a recombinant protein according 
to any of the preceding claims 1 to 7 characterized 
in that it comprises (a) preparing a vector 
according to any of preceding claims 9 to 11 , (b) 
transforming eukaryotic host cells with a vector 
obtainable from step (a) , (c) growing transformed 
host cells . according to claim 12 or 13 and 
obtainable from step (b) under conditions suitable 
for the expression of said recombinant protein. 



15. Method for purifying a recombinant protein according 
to any of preceding claims 1 to 7 characterized in 

15 that it comprises (a) preparing a vector according to 

any of preceding claims 9 to 11, (b) transforming 
eukaryotic host cells with a vector obtainable from 
step (a) , (c) growing transformed host cells 
according to claim 12 or 13 and obtainable from step 

20 (b) under conditions suitable for the overexpression 

of said recombinant protein, (d) purifying 
overexpressed recombinant protein by binding to 
endogenous actin/microtubules (mt) of the eukaryotic 
host cell, and (e) specifically releasing bound 

25 recombinant protein from actin/mt. 

16. Method according to claim 15 characterized in that 
step (e) comprises releasing the recombinant protein 
by adding a natural substrate of component (1) of 
the recombinant protein. 



30 17. Method according to claim 16 characterized in that 
the natural substrate is ATP. 



18. Method according to any of preceding claims 15 to 17 
characterized in that it comprises as further step 
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(f) at least one additional purification step by 
biochemical, particularly chromatographic, and/or 
physical methods. 

19. Method according to claim 18 characterized in that 
step (f ) comprises purifying by affinity 
chromatography, particularly using metals or 
antibodies as ligands. 
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Recombinant protein obtainable from a 
according to any of preceding claims 15 to 19. 



method 
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Method for crystallizing a recombinant protein 
according to any of preceding claims 1 to 7 
characterized in that it comprises (a) a 
purification step according to a method according to 
any of preceding claims 15 to 19 and (b) 
crystallizing the purified recombinant protein 
obtained in step (a) . 

Protein crystal characterized in that the crystal is 
built by a network of recombinant proteins according 
to any of preceding claims 1 to 7 that form the 
crystal lattice. 

Method for elucidating the atomic structure of a 
protein crystal according to claim 22 characterized 
in that it comprises (a) a crystallization step 
according to claim 21, (b) collecting X-ray 
diffraction data for a crystal obtained in step (a) , 
and (c) calculating the atomic structure of a 
recombinant protein by transformation of the data 
obtained in step (b) . 
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Fig. 1, Klichec/a/. 
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Fig. 4, Kliche et al. 
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MDGTEDPIHDRTSDYHKYLKVKQGDSDLFKLTVSDKRYIWYNPDPKERDSYECGEIVSE 
TSDSFTFKTVDGQDRQVKKDDANQRNPIKFDGVEDMSELSYLNEPAVFHNLRVRYNQDL 
I YTYSGLFLVAVNPFKRI PI YTQEMVDI FKGRRRNEVAPHI FAI SDVAYRSMLDDRQNQ 
S LLI TGE SGAGKTENTKKVI QYLAS VAGRNQANGSGVLEQQ I LQANP I LEAFGNAKTTR 
NNNS SRFGKF I E I QFNSAGF I SGAS I QS YLLEKSRWFQSETERNYH I F YQLLAGATAE 
EKKALHLAGPESFNYLNQSGCVDIKGVSDSEEFKITRQAMDIVGFSQEEQMSIFKIIAG 
ILHLGNIKFEKGAGEGAVLKDKTALNAASTVFGVNPSVLEKALMEPRILAGRDLVAQHL 
NVEKSSS SRDALVKALYGRLFLWLVKKINNVLCQERKAYFI GVLDI SGFE IFKVNSFEQ 
LC INYTNEKLQQFFNHHMFKLEQEE YLKEKINWTF I DFGLDSQAT I DL I DGRQP PGI LA 
LLDEQSVFPNATDNTLITKLHSHFSKKNAKYEEPRFSKTEFGVTHYAGQVMYEIQDWLE 
KNKDPLQQDLELCFKDS SDNWTKLFNDPN I ASRAKKGANF I TVAAQ YKEQLAS LMATL 
ETTNPHFVRCI I PNNKQL P AKLiEDKWLDQLRCNGVLEG I R I TRKG F PNR 1 1 YADFVKR 
YYLLAPNVPRDAEDSQKATDAVLKHLNIDPEQYRFGITKIFFRAGQLARIEEAREQR 
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MDG^fiDPIHDRTSDYHKYLKVKQGDSDLFKLTVSDKRYIWYNPDPKERDSYECGEIVSE 
TSDSFTFKTVDGQDRQVKKDDANQRNPIKFDGVEDMSELSYLNEPAVFHNLRVRYNQDL 
I YTYSGLFLVAVNPFKRI PI YTQEMVDIFKGRRRNEVAPHI FAI SDVAYRSMLDDRQNQ 

SLLITGESGAGKTENTKKVIQYLASVAGRNQANGSGVLEQQILQANPILEAFGNAKTTR 

NNNSSRFGKEIEIQENSAGFISGASIQSYLLEKSRWFQSETERNYHIFYQLLAGATAE 

EKKALHLAGPESFNYLNQSGCVDIKGVSDSEEFKITRQAMDIVGFSQEEQMSIFKIIAG 

ILHLGNIKFEKGAGEGAVLKDKTALNAASTVFGVNPSVLEKALMEPRILAGRDLVAQHL 

NVEKSSSSRDALVKALYGRLFLWLVKKINNVLCQERKAYFIGVLDISGFEIFKVNSFEQ 

LCINYTNEKLQQFFNHHMFKLEQEEYLKEKINWTFIDFGLDSQATIDLIDGRQPPGILA 

LLDEQSVFPNATDNTLITKLHSHFSKKNAKYEEPRFSKTEFGVTHYAGQVMYEIQDWLE 

KNKDPLQQDLELCFKDSSDNWTKLFNDPNIASRAKKGANFITVAAQYKEQLASLMATL 

ETTNPHFVRCIIPNNKQLPAKLEDKWLDQLRCNGVLEGIRITRKGFPNRIIYADFVKR 

YYLLAPNVPRDAEDSQKATDAVLKHLNIDPEQYRFGITKIFFRAGQLARIEEAREQRLG 

SEQTKSDYLKRANELVQWINDKQASLESRDFGDSIESVQSFMNAHKEYKKTEKPPKGQE 

VSELEAIYNSLQTKLRLIKREPFVAPAGLTPNEIDSTWSALEKAEQEHAEALRIELKRQ 

KKIAVLLQKYNRILKKLENWATTKSVYLGSNETGDSITAVQAKLKNLEAFDGECQSLEG 

QSNSDLLSILAQLTELNYNGVPELTERKDTFFAQQWTGVKSSAETYKNTLLAELERLQK 
I EDALHHHHHHHH 
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ATGGATGGTACCGAGGATCCAATTCATGATAGAACTTCAGATTATCACAAATACTTAAA 

AGTTAAACAAGGTGATTCTGATTTATTTAAACTTACTGTTTCAGATAAGAGATACATTT 

GGTATAATCCAGATCCAAAAGAAAGAGATTCATATGAATGTGGTGAAATTGTTTCAGAA 

ACCTCTGATTCTTTCACATTCAAAACCGTTGATGGTCAAGACAGACAAGTCAAAAAGGA 

TGATGCCAATCAACGTAATCCAATCAAATTCGATGGTGTCGAAGATATGTCTC-AATTAT 

CATACCTCAATGAACCAGCAGTTTTCCACAATCTCCGTGTTCGTTACAATCAAGATTTA 

ATTTACACCTATTCAGGTCTCTTTTTGGTTGCCGTCAATCCATTCAAGAGAATTCCAAT 

CTACACTCAAGAGATGGTTGATATCTTCAAAGGTCGTAGAAGAAATGAAGTTGCCCCAC 

ATATTTTCGCCATTTCTGATGTTGCCTATCGTTCAATGTTAGATGATCGTCAAAATCAA 

TCACTCTTAATCACTGGTGAATCTGGTGCTGGTAAGACTGAAAACACCAAAAAGGTCAT 

TCAATATCTTGCATCTGTCGCTGGTCGTAATCAAGCCAATGGTAGTGGTGTATTGGAAC 

AACAAATTCTCCAAGCCAATCCAATCCTTGAAGCTTTTGGTAATGCCAAAACCACCCGT 

AACAACAATTCATCTCGTTTCGGTAAATTCATTGAAATTCAATTCAACAGTGCTGGTTT 

CATTAGTGGTGCTTCAATTCAATCCTACCTTTTAGAGAAATCACGTGTCGTTTTCCAAT 

CTGAAACCGAACGTAATTATCACATTTTCTATCAACTCTTAGCTGGTGCCACCGCCGAA 

GAAAAGAAAGCTCTTCACTTGGCTGGTCCAGAATCATTCAACTACTTAAATCAAAGTGG 

TTGTGTTGATATCAAAGGTGTCTCTGATAGTGAAGAATTCAAAATCACTCGTCAAGCTA 

TGGACATTGTTGGTTTCTCACAAGAAGAACAAATGTCAATCTTTAAGATCATTGCTGGT 

ATCTTACATTTAGGTAACATCAAATTCGAAAAAGGTGCTGGTGAAGGTGCTGTCCTCAA 

AGACAAAACCGCCCTCAACGCTGCTTCAACCGTCTTTGGTGTCAATCCATCAGTCCTTG 

AAAAGGCTCTCATGGAACCACGTATTTTAGCCGGTCGTGATTTAGTTGCTCAACATCTC 

AACGTTGAAAAATCCTCATCATCAAGAGACGCTCTTGTCAAAGCTCTCTATGGTCGTCT 

TTTCCTCTGGTTGGTCAAAAAGATCAACAATGTCCTCTGTCAAGAGAGAAAAGCTTACT 

TTATTGGTGTTTTGGATATTTCAGGTTTTGAAATTTTCAAAGTCAATTCATTCGAACAA 

TTATGTATCAATTATACCAATGAAAAACTCCAACAATTCTTCAATCACCATATGTTCAA 

ATTGGAACAAGAAGAATATCTTAAAGAGAAAATCAATTGGACTTTCATCGATTTTGGTC 

TTGATTCACAAGCCACTATCGATTTAATTGATGGTCGTCAACCACCAGGTATTTTAGCT 

CTTTTGGATGAACAATCTGTTTTCCCAAATGCCACCGATAATACTTTAATCACCAAACT 

CCACAGTCACTTTAGCAAGAAGAACGCCAAATACGAAGAACCACGTTTCTCCAAAACCG 

AATTTGGTGTTACCCATTATGCTGGTCAAGTCATGTATGAGATTCAAGATTGGTTAGAA 

AAGAACAAAGATCCATTACAACAAGATCTCGAACTTTGCTTCAAAGATTCATCAGACAA 

CGTTGTCACCAAACTTTTCAATGATCCAAACATTGCCAGTCGTGCAAAGAAAGGTGCAA 

ACTTTATCACTGTCGCCGCTCAATACAAGGAACAATTAGCCTCACTCATGGCNACCCTT 

GAAACCACCAACCCACATTTCGTTCGTTGTATCATTCCAAACAACAAACAATTACCAGC 

CAAACTCGAAGATAAAGTTGTCCTCGACCAATTACGTTGCAATGGTGTCCTCGAAGGTA 

TTCGTATTACTCGTAAAGGTTTCCCAAATCGTATTATCTATGCCGATTTCGTCAAACGT 



18 



fl 2-01 -2001 



Printed: 14-03-2002 



DRAW 



01100762 



9/9 



TACTATTTATTAGCTCCAAACGTTCCAAGAGACGCTGAAGACTCACAAAAAGCCACCGA 
TGCTGTTCTCAAACATCTTAACATTGATCCAGAACAATATCGTTTCGGTATCACCAAGA 
TTTTCTTCCGTGCCGGTCAATTAGCTCGTATTGAAGAAGCTCGTGAACAACGTCTAGGA 
TCCGAACAAACCAAATCTGATTATCTTAAAAGAGCCAATGAACTCGTTCAATGGATTAA 
CGATAAACAAGCATCACTTGAATCACGTGATTTTGGTGATTCCATCGAATCTGTTCAAA 
GTTTCATGAACGCTCATAAAGAATATAAAAAAACCGAAAAACCACCAAAGGGTCAAGAA 
GTCTCTGAATTGGAAGCTATCTACAATTCATTACAAACTAAATTACGTTTAATTAAACG 
TGAACCATTTGTTGCACCAGCTGGTCTCACTCCAAATGAAATCGATTCCACTTGGTCCG 
CTTTAGAGAAAGCTGAACAAGAACATGCTGAAGCCCTCCGTATTGAACTCAAACGTCAA 
AAGAAAATTGCAGTTCTCTTACAAAAATACAATCGTATTCTCAAGAAACTCGAAAACTG 
GGCCACCACCAAATCTGTCTACCTCGGTTCCAATGAAACCGGTGACAGTATCACTGCTG 
TTCAAGCTAAATTAAAGAATTTAGAAGCTTTTGATGGTGAATGTCAATCATTGGAAGGT 
CAATCAAACTCTGATCTCCTCAGCATTCTTGCTCAATTAACTGAACTCAACTACAATGG 
TGTACCAGAACTCACTGAACGTAAAGATACATTCTTTGCTCAACAATGGACTGGTGTTA 
AATCATCTGCTGAAACCTACAAAAACACTCTTTTAGCTGAACTTGAAAGACTCCAAAAG 
ATTGAAGATGCATTACATCATCATCATCATCATCACCAC 




